Interactive Speech and Noise Modeling for Speech Enhancement

نویسندگان

چکیده

Speech enhancement is challenging because of the diversity background noise types. Most existing methods are focused on modelling speech rather than noise. In this paper, we propose a novel idea to model and simultaneously in two-branch convolutional neural network, namely SN-Net. SN-Net, two branches predict noise, respectively. Instead information fusion only at final output layer, interaction modules introduced several intermediate feature domains between benefit each other. Such an can leverage features learned from one branch counteract undesired part restore missing component other thus enhance their discrimination capabilities. We also design extraction module, residual-convolution-and-attention (RA), capture correlations along temporal frequency dimensions for both noises. Evaluations public datasets show that module plays key role simultaneous modeling SN-Net outperforms state-of-the-art by large margin various evaluation metrics. The proposed shows superior performance speaker separation.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech/noise-dominant decision for speech enhancement

A novel method to reduce additive non-stationary noise is proposed. The proposed method requires neither the statistical assumption about noise nor the estimate of the noise statistics from any pause regions. The enhancement is performed on a band-by-band basis for each time frame. Based on both the decision on whether a particular band in a frame is speech or noise dominant and the masking pro...

متن کامل

Noise estimation for efficient speech enhancement and robust speech recognition

Different approaches of minima tracking based noise estimation algorithms are compared and modifications increasing their efficiency are proposed. Estimated noise is used by noise suppression algorithm that is a part of speech recognition system. Moreover, the algorithms are developed to be applied in feature extraction of Distributed Speech Recognition (DSR). Therefore we propose such modifica...

متن کامل

Inter-frame modeling of DFT trajectories of speech and noise for speech enhancement using Kalman filters

In this paper a time-frequency estimator for enhancement of noisy speech signals in the DFT domain is introduced. This estimator is based on modeling the time-varying correlation of the temporal trajectories of the short time (ST) DFT components of the noisy speech signal using autoregressive (AR) models. The timevarying trajectory of the DFT components of speech in each channel is modeled by a...

متن کامل

Speech/Noise-Dominant Decision regardless of SNR for Speech Enhancement

In speech enhancement, a decision between speech dominant and noise one is important to reduce noise for increasing intelligibility. This paper presents a speech/noise-dominant decision regardless of SNR. In the proposed decision, the influence of noise is reduced by subtracting the noise component. Therefore, the proposed method decides between the speech dominant and noise one accurately. Fro...

متن کامل

Adaptive speech enhancement for speech separation in diffuse noise

An adaptive enhancement method is proposed to improve recognition accuracy on the outputs of blind speech separation (BSS) system based on adaptive decorrelation filtering (ADF) in diffuse noise. A divide and conquer strategy is taken to deal with the noise effects on both system adaptation and ADF outputs. First, fast noise compensation (NC) is performed for filter adaptation, forcing ADF to f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i16.17710